Dynamic Programming Algorithm Optimization for Spoken Word Recognition
نویسنده
چکیده
This paper reports on an optimum dynamic programming (DP) based time-normalization algorithm for spoken word recognition. First, a general principle of time-normalization i s given using timewarping function. Then, two time-normalized distance definitions, d e d symmetric and asymmetric forms, are derived from the principle. These two forms are compared with each other through theoretical discussions and experimental studies. The symmetric form algorithm superiority is established. A new technique, called slope constraint, is successfully introduced, in which the warping function slope is restricted so as to improve discrimination between words in different categories. The effective slope constraint characteristic is qualitatively analyzed, and the optimum slope constraint condition is determined through experiments. The optimized algorithm is then extensively subjected to experimentat comparison with various DP-algorithms, previously applied to spoken word recognition by different research groups. The experiment shows that the present algorithm gives no more than about twothirds errors, even compared to the best conventional algorithm. I
منابع مشابه
A New Optimization via Invasive Weeds Algorithm for Dynamic Facility Layout Problem
The dynamic facility layout problem (DFLP) is the problem of finding positions of departments onthe plant floor for multiple periods (material flows between departments change during the planning horizon)such that departments do not overlap, and the sum of the material handling and rearrangement costs isminimized. In this paper a new optimization algorithm inspired from colonizing weeds, Invasi...
متن کاملSpoken Digit Recognition by AugmentedContinuous DP Matching and its Evaluation
Recently, we proposed the Augmented Continuous dynamic time warping algorithm for connected spoken word recognition. The algorithm is based on the same principle as the Two Level DP and Level Building DP. Our algorithm obtains a near optimal solution for the recognition principle based on pattern matching. However, it is computationally more efficient than the conventional methods and does not ...
متن کاملTitle Connected Spoken Digit Recognition by Augmented Continuous DP Matching and its Evaluation
Recently, we proposed the Augmented Continuous dynamic time warping algorithm for connected spoken word recognition. The algorithm is based on the same principle as the Two Level DP and Level Building DP. Our algorithm obtains a near optimal solution for the recognition principle based on pattern matching. However, it is computationally more efficient than the conventional methods and does not ...
متن کاملA Level Building Dynamic Time Warping Algorithm for Connected Word Recognition
Dynamic time warping has been shown to be an effective method of handling variations in the time scale of polysyllabic words spoken in isolation. This class of techniques has recently been applied to connected word recognition with high degrees of success. In this paper a level building technique is proposed for optimally time aligning a sequence of connected words with a sequence of isolated w...
متن کاملIs Word Error Rate a Good Indicator for Spoken Language Understanding Accuracy
It is a conventional wisdom in the speech community that better speech recognition accuracy is a good indicator for better spoken language understanding accuracy, given a fixed understanding component. The findings in this work reveal that this is not always the case. More important than word error rate reduction, the language model for recognition should be trained to match the optimization ob...
متن کامل